Enhanced gene expression in algae

ABSTRACT

The invention provides eukaryotic unicellular algae engineered to express a nucleosome alteration protein fused to a protein with affinity to the DNA binding site acting in coordination. An example is a LexA-p300 fusion protein, where the p300 is derived from  Chlamydomonas . The LexA binding domain guides the p300 to the binding site and the p300 loosens the nucleosome structure by acetylating histones within proximity of the transgene, thus remodeling the local chromatin structure to allow for high-level expression.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 13/504,741, entitled “ENHANCED GENE EXPRESSION IN ALGAE” filed Apr. 27, 2012, which issued as U.S. Pat. No. 8,476,019, on Jul. 2, 2013, and which is a national phase of PCT/US2010/055012, entitled “ENHANCED GENE EXPRESSION IN ALGAE” filed Nov. 1, 2010, which claims the priority benefit under 35 U.S.C. §119(e) of U.S. Provisional Application Ser. No. 61/256,921, entitled “ENHANCED GENE EXPRESSION IN ALGAE” filed Oct. 30, 2009, which are both incorporated herein by reference in their entirety.

BACKGROUND OF THE INVENTION

1. Field of the Invention

This invention relates to the field of molecular biology and in particular to the expression of transgenes in algae.

2. Description of the Background

Transgenes are foreign DNA sequences introduced into genomes, in the case of eukaryotic cells within the chromosomes. These genes are usually transcribed as any other gene of the host. Transcription is generally controlled by the chromatin structure that packs the chromosome's DNA into tight bundles in eukaryotic organisms called nucleosomes. As the chromatin structure around a specific gene relaxes, the DNA of the particular gene becomes accessible to the transcription machinery of the cell. Staining indicates that actively transcribed genes in eukaryotes are more loosely incorporated in nucleosomes and more prevalent in euchromatin. In some instances, transgenes are incorporated into the host's chromosome but fail to be expressed due to unfavorable chromatin structures. This phenomenon is called “gene silencing.” The ability to control how tightly a nucleosome is packed can help enhance the expression of transgenes in host cells. In mammalian cells, it has been proposed that coupling transgene expression with increased availability of a histone “tail” modifying gene, p300 (also known as a histone acetyl transferase, or “HAT”; in the family of CREB binding proteins, or “CBP”), can increase the expression level, presumably because the acetyl transferase activity causes a looser histone-DNA association and allows transcription factors access to the genes. T. H. J. Kwaks et al., J. Biotechnology, 115:35-46 (2005).

Microalgae encompass a broad range of organisms, mostly unicellular aquatic organisms. The unicellular eukaryotic microalgae (including green algae, diatoms, and brown algae) are photosynthetic and have a nucleus, mitochondria and chloroplasts. The chromatin structure in algae is distinct from other eukaryotes. The chromatin in algae stains heavily, indicating a more compact nucleosome structure and tight association of the DNA to the histones. These differences in chromatin structure of microalgae, particularly in green algae, suggest distinct mechanism of histone chromatin regulation of gene expression.

These differences in eukaryotic microalgae chromatin structure may be the factor behind the observation that stable nuclear transgene expression in microalgae is difficult and transient due to chromatin mediated gene silencing. H. Cerutti, A.M.J., N. W. Gillham, J. E. Boynton, Epigenetic silencing of a foreign gene in nuclear transformants of Chlamydomonas, The Plant Cell 9:925-945 (1997). When genetic constructs comprising a mammalian derived anti-apoptotic gene and a fluorescent reporter gene were previously introduced by the present inventors in algae, the expression levels were low and no expression of the fluorescence gene was detected, thus confirming that transgenes are difficult to express in algae.

Algae are considered an important source of healthy nutrients for human consumption and are important as biomass and biofuels. Genetic engineering and stable (over multiple generations) expression of transgenes would open new horizons and greatly enhance the value and desirability to beneficially culture algae. However, as noted above, stable and sufficiently high level of gene expression has been difficult to achieve. A method to improve transgene expression in algae and make that expression stable would be very useful. Such an approach would need to account for the uniquely robust histone mediated gene silencing of microalgae including green algae.

SUMMARY OF THE INVENTION

In accordance to one embodiment, the invention provides a system for enhanced gene expression in algae, the system comprising:

-   -   an algae compatible transcriptional promoter functionally         upstream of a coding sequence for a gene expression enhancer         (GEE) fusion protein, wherein the fusion protein comprises an         algae derived p300 functionally fused to the DNA binding         protein, wherein at least the portion of the coding sequence of         the DNA binding protein domain is codon optimized for improved         expression in an algae;     -   at least one transgene functionally downstream of an algae         compatible transcriptional promoter; and     -   at least one DNA region that is a binding site for the DNA         binding protein, in vicinity of at least one of said         transcriptional promoters;     -   wherein said system resides in an algae.

In a preferred embodiment, the DNA binding protein is LexA DNA Binding domain. In another preferred embodiment, the p300 part of the GEE fusion protein is from Chlamydomonas reinhardtii. In a more preferred embodiment, only a HAT domain of the p300 protein is part of the GEE fusion protein. The p300 or only the HAT domain of p300 may be derived from homologs of other microalgae including green algae in addition to Chlamydomonas reinhardtii.

In accordance to another embodiment, the transgene is codon modified for improved expression in algae. In a preferred embodiment, the transgene or gene of interest (GOI) is a fluorescence-Bcl-x_(L) fusion gene. The fusion protein may include a fluorescence-Bcl-x_(L) construct (e.g. YFP-Bcl-x_(L) fusion or a Venus-Bcl-x_(L) fusion). In another preferred embodiment, the transgene is the YFP/Venus gene, not necessarily part of a fusion protein. Venus is an enhanced yellow fluorescent protein (YFP) that is stable over a wide range of pH, folds quickly, and emits at 30-fold the intensity of conventional YFP. Nagai T., Ibata K., Park E. S., Kubota M., Mikoshiba K. and Miyawaki A. (2002). A variant of yellow fluorescent protein with fast and efficient maturation for cell-biological applications. Nature Biotechnol, 20, 87-90.

In accordance to another embodiment, the system further comprises at least one selective marker such as an antibiotic resistance marker. In a preferred embodiment, the GEE fusion protein and the at least one transgene are introduced into the system on one vector and structurally arranged to be expressed from one bidirectional promoter region and comprising DNA binding sites in the vicinity of both promoters. In a more preferred embodiment, the GEE fusion protein and the transgene are introduced in the system on separate vectors, each comprising a selective marker and the selective markers are not the same. When separate vectors, both the GEE vector and the vector for the gene of interest (GOI) will contain selective markers. When the GEE is introduced on a separate vector from the vector for the GOI, the GEE vector may be used to generate a stable algae cell line that will serve as the recipient for the second vector expressing the GOI. This stable GEE algae cell line will function to enhance the expression of the second vector containing the GOI.

In accordance to yet another embodiment, the algae compatible transcriptional promoters are hsp70, rbcS, nitA, actin, tubA2 or a combination thereof.

In accordance to another yet embodiment, the GEE fusion protein comprises a DNA binding domain functionally fused to an algae derived p300 homologue having at least 80% identity over the HAT region to the p300 from Chlamydomonas reinhardtii. Preferably, the GEE fusion protein comprises a DNA binding domain functionally fused to the HAT domain of the HAT region to the p300 from Chlamydomonas reinhardtii. It is noteworthy that the p300 from mammalian species is much larger in size and is much less that 50% similar to Chlamydomonas reinhardtii p300.

The invention also provides a method of expressing a gene in algae at higher levels, comprising:

-   -   transforming algae with at least one vector comprising:     -   an algae compatible transcriptional promoter functionally         upstream of a coding sequence for a gene expression enhancer         (GEE) fusion protein, wherein the fusion protein comprises an         algae derived p300 functionally fused to the DNA binding         protein, wherein at least the portion of the coding sequence of         the DNA binding protein domain is codon optimized for improved         expression in an algae;

at least one transgene functionally downstream of an algae compatible transcriptional promoter; and

-   -   at least one DNA region that is a binding site for the DNA         binding protein, in vicinity of at least one of said         transcriptional promoters;     -   selecting a transformed algae cell; and     -   detecting the expression of said GEE gene and/or said transgene         in algae.

In a preferred embodiment, the DNA Binding protein is the LexA binding domain, and more preferably the p300 is from Chlamydomonas reinhardtii. More preferably yet, the GEE fusion protein comprises the LexA binding domain functionally fused with the HAT domain of the p300 protein from Chlamydomonas reinhardtii.

In accordance to another embodiment, the transgene is a YFP-Bcl-x_(L) fusion protein or a Venus-Bcl-x_(L) fusion protein.

In accordance to yet another embodiment, the GEE fusion protein and said transgene are transformed in algae on separate vectors, first selecting a vector stably expressing the GEE fusion protein and then transforming the selected algae with the vector comprising the transgene.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates the features of a vector in accordance to one embodiment of the present invention. The direction of transcription is indicated by arrows. The figure indicates certain structural components, as discussed herein elsewhere. The linear drawing provides further details of the respective region of the vector: a fused LexA-p300 protein coding region and a coding region of a GOI. In this embodiment, these two coding regions are transcribed in opposite directions (thus the “bidirectional” nomenclature). These two coding regions are separated by a locus comprising LexA binding sites.

FIGS. 2A and 2B illustrate another embodiment of the invention. FIG. 2A is a vector expressing a LexA-p300 fusion protein. FIG. 2B illustrates a vector expressing another gene which is advantageously introduced in algae (“gene of interest or “GOI”). In accordance to this embodiment, each of the LexA-p300 fusion and the GOI have, at or near their 5′-ends, LexA binding site(s).

FIG. 3 compares the putative p300 protein from algae (Chlamydomonas reinhardtii (“Chlamy”) with known p300 proteins from the indicated phylogenetically representative species. The lighter colored section of each bar represents the histone acetylase (HAT) domain. The HAT domains are aligned for visualization purposes. These lighter bars include numbers that are indicative of the percent identity of the HAT domain of each protein proteins within each panel, with the indicated percentages of identity of each HAT protein to the p300 HAT domain of the p300 protein from Chlamy. The figure is drawn to scale, both in respect to the overall size of the p300 proteins and the location of the HAT domain within the protein.

DETAILED DESCRIPTION

Expression of transgenes in the algae is difficult. H. Cerutti, A.M.J. et al., The Plant Cell 9:925-945 (1997). Likewise, when the present inventors transformed a microalgae with a construct expressing a yellow florescence protein (“YFP”) fused to a cancer suppressing Bcl-x_(L) gene (the transcription driven by the rubisco promoter (rcbS2) and relying on a heat shock translational enhancer (HSP70)), the transformed microalgae failed to produce fluorescence. However, transformants which survived marginally longer and were morphological affected (the result of limited expression of the Bcl-x_(L) gene) were observed. It is expected that that gene silencing contributed to the poor expression of the transgenes in algae.

The present invention provides an effective method to increase transgene expression in algae, preferably a green algae, more preferably a microalgae. A preferred algae of the invention is an unicellular, photosynthetic algae. A yet more preferred algae is the microalgae. The GOI transgene expressed in the algae in accordance to the invention is expressed to a higher level. The expression is increased by at least 50%, preferably about two to at least five fold, relative to the expression of the same transgene engineered in the algae without the benefit of the present invention. In respect of fluorescence transgenes, the expression is increased sufficiently to allow monitoring the fluorescence signal. More preferably, the fluorescence signal is monitored in Chlamydomonas.

The transgene is introduced in algae. In accordance with an embodiment of the present invention, the transgene is placed on a vector. The vector is a nucleic acid structure used to introduce a cassette containing a DNA sequence into an algae chromosome. The vector is introduced in the nucleus of a host algae cell and the transgene is transcribed/translated in the algae. Methods of transformation of algae are well known to artisans skilled in the art. For example, a vector construct may be introduced via electroporation, via plasmid conjugation, and via particle bombardment. The transformed algae are recovered on a solid nutrient media or in liquid media. Elizabeth H Harris, Chlamydomonas As A Model Organism, Annual Review of Plant Physiology and Plant Molecular Biology 52:363-406 (2001) and EMBO Practical Course: Molecular Genetics of Chlamydomonas, Laboratory protocols. Geneva, Sep. 18-28, 2006.

A preferred vector of the invention is a plasmid capable of integrating the DNA sequence of interest into a chromosome of the algae. There are a large numbers of vectors known and characterized. A preferred vector of the invention is pSP124. Lumbreras et al., Efficient foreign gene expression in Chlamydomonas reinhardtii mediated by an endogenous introns, The Plant Journal 14(4):441-447 (1998).

Methods of engineering vectors are well known in the art. The vector backbone may include genes encoding transformation markers, to indicate transformation of the host cell with the vector. A transformation marker may be a selective marker gene used to select cells in which the vector is present from normal cells without the vector. Selective markers are well known to artisans skilled in the art. Commonly used selective markers include genes that confer resistance to specific antibiotics such as bleomycin. Only cells containing the vector grow in media containing the antibiotic. Other vector backbones may also include marker genes that merely indicate which cells were transformed. When such markers are used, cells with and without the vector will grow but the cells containing the vector can be distinguished from those not having the vector because they display a specific characteristic conferred by the vector, e.g., color. A commonly used transformation marker gene is the yellow or green fluorescence gene. Cells containing a vector with such a gene will be yellow or green. Other common transformation markers include various luciferase genes. Cells containing the luciferase genes emit light.

Any effective combination of gene expression regulatory features compatible with expression of genes in the algae nucleus can be incorporated in the vector. The plasmid may include different types of promoters, for example constitutive promoters or inducible promoters. Preferred transcriptional promoters in accordance to the invention include the hsp70 (“heat shock protein” promoter), rbcS (“rubisco small subunit” promoter) and tubA2 (“actin” promoter). The vector employs suitable translational enhancer elements, generally referred to as 5′untranslated regions or “5′UTR.” Preferred enhancers in accordance to the invention are the tubA2 intron 1, the HSP70 enhancer, and the rcbS2 intron 1. The vector of the invention includes also effective translational terminators, 3′UTR. Examples of preferred 3′-UTR sequences include the tubA2, HSP70, and rcbS2 3′UTR. Other effective promoters, transcription enhancers and terminators may, in particular combinations, may produce satisfactorily high and stable expression.

Some of these options are illustrated in FIGS. 1 and 2. The features selected to be exemplified in FIGS. 1 and 2 include the promoter and 3′UTR regions of the Chlamy genes: tubA2 encoding actin (Tubulin); rbcS2 encoding the rubisco small subunit; or nitA encoding nitrate reductase. Furthermore, the hsp70A/rbcS2 tandem promoter is a preferred driver of transgene expression. Schroda M., Beck C. F. and Vallon A., Sequence elements within an hsp70 promoter counteract transcriptional transgene silencing in Chlamydomonas. Plant J. 31:445-455 (2002). This chimeric promoter contains the enhancer region of the nucleo-cytoplasmic-localized 70 kD heat shock protein gene (NCBI GenBank ID: M76725; by 572-833) and the promoter from the nuclear rubisco small subunit gene (NBCI GenBank ID: X04472; bp 934-1142). Additionally, the first intron (bp 1307-1451) and 3′-untranslated region (bp 2401-2632) of the rbcS2 gene may be included to further promote stable transgene expression.

In accordance with an embodiment of the present invention, one or more vectors are used to introduce a cassette that contains a gene of interest (“GOI”) and a gene silencing inhibitor into the nucleus DNA of algae, e.g., a Chlamy nucleus. The GOI can be any gene desirably expressed in algae. Viable genes of interest include genes involved in controlling algae's metabolic pathways. For example, in one embodiment of the present invention the Bcl-x_(L) gene can be inserted and expressed in the algae's nucleus. Bcl-x_(L) is an abbreviation for B-cell lymphoma extra-large; it is known to be an inhibitor of apoptosis (programmed cell death). Boise L. H. et al., Bcl-x, a bcl-2-related Gene that Functions as a Dominant Regulator of Apoptotic Cell Death, Cell 74:597-608 (1993). In another embodiment genes affecting lipid or isoprenoid production pathways are desirably introduced. Due to Bcl-x_(L)'s ability to inhibit apoptosis, its expression allows algae cells to live longer. A longer lifespan for microalgae enables the use of microalgae in various industrial applications such as photobioreactors.

A gene silencing inhibitor is also introduced into the algae. A gene silencing inhibitor is a peptide that induces relaxation of nucleosomes in the algae's nucleus. Gene silencing inhibitors include histone acetyl transferases (HATs) and other peptides that modify elements of the nucleosome, causing the chromatin structure to relax and to allow transcription factors to access the gene of interest. HAT proteins and the HAT domains of p300 and of other HAT proteins are known to cause histone acetylation and can be utilized in the invention. In accordance to the invention the domain responsible for the acetylation activity or the whole protein is deployed. See Fukuda H, et al., Brief Funct. Genomic Proteomic, 5(3):190-208 (2006); Renthal W. and Nestler E. J., Semin Cell Dev Biol. 20(4):387-94 (Epub 2009); and Lin Y. Y. et al., Genes Dev., 22(15):2062-74 (2008).

One preferred embodiment of the present invention utilizes a p300 protein as a gene silencing inhibitor. More preferably, a Chlamy derived p300 protein is utilized. In a yet more preferred embodiment, the Chlamy p300 protein is the homologue detailed in FIG. 3. In a further more preferred embodiment, only the HAT domain of the Chlamy p300 gene is utilized. See FIG. 3 and relevant portion of SEQ ID NO 4.

FIG. 3 shows an alignment comparison of the Chlamy p300 with phylogenetically distinct other p300 homologues. The lighter colored section of each bar represents the histone acetylase (HAT) domain. The HAT domains are aligned for visualization purposes. These lighter bars include numbers that are indicative of the percent identity of the HAT domain of each protein proteins with the indicated percentages of identity of each HAT protein to the p300 HAT domain of the p300 protein from Chlamy. FIG. 3 is drawn to scale, both in respect to the overall size of the p300 proteins and the location of the HAT domain within the protein.

Table 1, exemplifies the highly conserved nature of the p300 proteins and particularly conserved nature of the HAT domains.

TABLE 1 Comparison of HAT domain identity within each phylogenetic clade.

- 100% - 100% - 100% - 100% V. carteri - 85% G. max - 91% A. gambiae - 92% M. mulatta - 100% O. sativa - 91% C. floridanus - 89% O. cuniculus - 100% S. bicolor - 90% R. norvegicus - 99% P. trichocarpa - 88% H. musculus - 99% Microalgae Higher Plants Insects Mammals The bolded organism at the top of each column is the representative species to which all other percent identities are based.

Indeed, the percent identity between plant and mammalian p300 homologues is also very high, typically at least about 80%. See US Patent Publication US2003/0145349. However, the homology of the Chlamy p300 homologue to other organisms is lower. Likewise, the p300 full protein of Chlamydomonas reinhardtii is 11.5% identical and further 9.9% similar to the mouse p300 protein; 9.1% identical and a further 4.7% similar to the Drosophila p300 protein; and 23.6% identical and a further 9.9% similar to the Arabidopsis p300 protein. The Chlamy derived protein has N-terminal or C-terminal regions which are shorter and dissimilar in their location visa-vie the HAT domain to these of the mammalian or plant p300 proteins. See FIG. 3. This is suggestive of proteins with overall distinct functions and phylogeny.

The various proteins p300 homologues in FIG. 1 and described herein elsewhere are:

-   C. reinhardtii p300/HAT Protein ID: 159467703 from NCBI Database. -   V. carteri p300/CBP Protein ID: 300256266 from NCBI Database. -   S. bicolor putative p300 Protein ID: C5XTZ4 from Universal Protein     Resource. -   P. trichocarpa GenBank ID: POPTR_(—)007s15090 from Joint Genome     Institute Database. -   G. max Protein ID: PF02135 from Joint Genome Institute Database. -   A. thaliana HAC1/p300/CBP GenBank ID: NM_(—)106550.3 from NCBI     Database. -   O. saliva p300/CBP Protein ID: 108792657 from NCBI Database. -   D. melanogaster CBP/HAT Genbank ID: NM_(—)079903.2 from NCBI     Database. -   A. gambiae HAT Protein ID: 158289391 from NCBI Database. -   C. floridanus CBP Protein ID: 307172990 from NCBI Database. -   M. musculus E1A/BP/p300 GenBank ID: NM_(—)177821.6 from NCBI     Database. -   O. cuniculus p300 Protein ID: 291410334 from NCBI Database. -   R. norvegicus p300 Protein ID: XP_(—)576312.3 from NCBI Database. -   M. mulatta p300 HAT Protein ID: XP_(—)001102844.1 from NCBI     Database. -   H. sapiens p300 Protein ID: NP_(—)001420.2 from NCBI Database.

In another preferred embodiment of the present invention, the gene silencing inhibitor is functionally tethered or, preferably, fused to a DNA binding protein or domain thereof (the tethered/fused protein or its/their gene hereinafter are referred to as the gene expression enhancer unit, or “GEE”). The DNA binding protein or domain binds to a particular DNA sequence (Binding Site or “BS”), bringing the gene silencing inhibitor to its histone target at a location in the vicinity of the BS and thereby inducing relaxation of the nucleosome at that genetic location. As the nucleosome relaxes, the nearby DNA sequence is exposed to transcription factors and is more actively transcribed.

In accordance to a preferred embodiment, the invention requires the expression in an algae protein that binds specific DNA sequences, which sequences can be engineered upstream of any GOI for expression in algae. The DNA binding protein/domain can be any protein having known DNA binding sites can be used. Examples of proteins targeting specific DNA motifs applicable to this invention include the Gal4 protein and Early Growth Response Protein 1. DNA binding site motifs for these proteins are known. Likewise, the binding domains of these as well as the LexA protein are known and are preferentially used, instead of the full-length protein. See for example Young, K., Biol. Reprod., 58:302-311 (1998) and Joung, J. et al., Proc. Natnl. Acad. Sci., 97:7382-7 (2000). The DNA binding site (BS) for Gal4 is 5′-CGGAGGACAGTCCTCCG-3′ (SEQ ID NO 10).

LexA is a preferred example of a DNA binding protein. LexA is a gene of bacterial origin. LexA proteins or genes are not known in algae. Thus, it is unlikely that the Chlamy genome will contain the DNA binding sequence of LexA. The function of LexA in the context of the invention is to bind a particular DNA sequence (binding site, “BS”). LexA binding sites are found upstream promoters in a number of microorganisms. A consensus BS sequence for LexA is CTGTATATATATACAG. SEQ ID NO 9. The binding domain of the LexA protein is known and, for the purpose of the invention, it is preferred to employ only the binding domain. Protein ID: 2293118 from NCBI Database: MKALTARQQEVFDLIRDHISQTGMP PTRAEIAQRLGFRSPNAAEEHLKALARKGVIEIVSGASRGIRLLQEEEEGLPLVGRVAAG EPLLAQQHIEGHYQVDPSLFKPNADFLLRVSGMSMKDIGIMDGDLLAVHKTQDVRNGQ VVVARIDDEVTVKRLKKQGNKVELLPENSEFKPIVVDLRQQSFTIEGLAVGVIRNGDWL EFPGIRRPWRPLESTCSQANSGRISYDL (SEQ ID NO 11).

As noted above, the DNA binding protein or domain thereof, preferably the LexA domain, is constructed to translate in a protein allowing the DNA binding domain and a nucleosome relaxation protein to act in concert. Any nucleosome relaxation protein might be used. Preferably, as noted above, a Chlamy p300 domain is used.

Without being limited to a single mechanism of action, it is proposed that one partner binds to the DNA, the other acetylates nearby histones, thereby creating a looser association between the DNA and the histones at that site. Therefore any method to render the DNA binding domain and the acetylase domain spatially close to each other is preferred. A fused protein is more preferred. The order of the two units (N-terminal proximity) within the fusion protein is not critical. However, in the p300-LexA binding domain example, it is preferred that LexA binding domain is at the N-terminal end of the fusion. “Functional” fusion proteins are designed. By way of example, certain linker regions are introduced to allow flexibility, orientation or simply “dead” protein sequence corresponding to strategically placed genetic engineering features such as primers and restriction enzyme sites.

Preferably, the GEE can be a p300 peptide homolog and the DNA binding domain can be LexA binding domain, creating a p300-LexA binding domain fusion protein and its gene construct. Preferably, that fusion is an algae p300-LexA binding domain fusion. More preferably, the fusion is the Chlamy p300-LexA fusion. Alternatively, the fusion comprises select domains of the Chlamy p300-LexA proteins. See SEQ ID NO 4. Yet more preferably, the fusion, at the nucleic acid level, comprises a LexA sequence modified in its codon usage for higher yield when expressed in algae. Preferably, the whole of the GEE fusion protein gene was designed for preferred codon usage in algae, even if part of the gene (p300) is an algae derived gene, as provided by SEQ ID NO 1 and SEQ ID NO 3. Indeed, the transgene (GOI) and other genes in the system preferably are codon optimized based on codon frequency in algae.

It should be noted that other algae p300 homologues or their acetylasehistone acetyltransferase (HAT) domains may be preferentially used in the invention. However, these preferred homologues must be at least about 60% identical to the Chlamy p300, preferably at least about 70% identical, at least about 80% identical, at least about 85% identical, at least about 90% identical or more. A p300 homologue from V. carteri (algae) was recently identified. It has about 85% identity to the Chlamy p300, over the HAT domains.

The LexA-p300 fusion DNA (SEQ ID 1) is the nucleotide sequence encoding a fusion protein (exemplary GEE) comprising the LexA binding domain and the full length Chlamy p300 sequence, all of the fusion designed to reflect preferred codon usage in algae. It was adapted to the nuclear codon usage of C. reinhardtii according to the table provided by the Kazusa Codon Usage Database (Species ID: 3055), using Gene Designer software from DNA 20. The sequence up to nucleotide 690 is that of the LexA DNA binding domain and the fell length C. reinhardtii p300 sequence begins at nucleotide 700. A 3-amino acid peptide linker (GVL) connects LexA binding domain and p300, which represents the DNA restriction site PpuMI (9 bp). The LexA gene sequence is codon-optimized for C. reinhardtii nuclear expression using AA sequence from Protein ID: 2293118 from NCBI Database:

SEQ ID NO 1 1 ATGAAGGCTCTGACCGCTCGCCAGCAGGAGGTGTTTGATCTGATTCGGGA 51 CCATATCAGCCAAACGGGCATGCCCCCTACGCGCGCGGAGATCGCGCAAC 101 GGCTGGGCTTCCGCTCCCCGAACGCGGCTGAGGAGCACCTGAAGGCGCTG 151 GCGCGCAAGGGTGTGATTGAGATCGTCTCCGGCGCGTCGCGGGGCATTCG 201 GCTGCTGCAGGAGGAGGAGGAGGGTCTGCCTCTGGTGGGGCGGGTGGCTG 251 CGGGCGAGCCCCTGCTGGCCCAGCAGCACATTGAGGGCCACTACCAAGTG 301 GACCCGTCCCTCTTCAAGCCGAACGCCGATTTCCTGCTGCGCGTCAGCGG 351 TATGAGCATGAAGGACATCGGCATCATGGACGGTGACCTGCTGGCCGTGC 401 ATAAGACGCAGGACGTGCGCAACGGCCAAGTGGTCGTCGCCCGCATCGAT 451 GACGAGGTGACCGTGAAGCGCCTGAAGAAGCAGGGGAACAAGGTCGAGCT 501 GCTGCCCGAGAACAGCGAGTTCAAGCCCATCGTGGTGGATCTGCGCCAGC 551 AATCCTTCACCATCGAGGGCCTGGCGGTGGGCGTGATCCGCAACGGCGAC 601 TGGCTGGAGTTCCCGGGCATCCGCCGCCCGTGGCGCCCTCTGGAGTCCAC 651 GTGCTCGCAGGCCAACTCCGGCCGCATTAGCTACGACCTGGGGGTCCTTA 701 TGGTGCCGATGGGCGCGCCCGCTATGCCCATGGGCAACAACGGCTCGCCC 751 ATGCTGAACGGCATGGGTATGTTCAACGCCCCGCAGCAGACCGTGCCCAA 801 CGGCGGGCCGGGTGGCGTGAACCCCATGGGCCAGGTGCCGGCGATGCCTG 851 CGCCGATCCCCAACGGCGGTCTGCCCGGTATGAACGCTGCCGGCGGTGCC 901 GACGATCCTGCGAAGCAGCGGGAGCAATCGATCCAGAAGCAGCAGCGCTG 951 GCTGCTGTTCCTGCGGCACTGCGCGAAGTGCCGGGCTCCCGGCGAGGACT 1001 GCCAGCTGAAGTCCCAGTGCAAGTTCGGCAAGCAGCTGTGGCAGCACATC 1051 CTGTCGTGCCAAAACCCGGCCTGCGAGTACCCGCGCTGCACCAACTCCAA 1101 GGATCTGCTCAAGCACCACCAGAAGTGCCAGGAGCAGACCTGCCCCGTGT 1151 GCATGCCGGTGAAGGACTACGTGAAGAAGACGCGCCAGGCGACCCAACAG 1201 CAGCAACAAATGCAGCAACAACAGCAAATCCAGCAACAGCAACAACAACA 1251 AATGCAACAGCAACAGATGCAACAGCAGCAGCTCCAGCAGCAGCAGATGC 1301 AACAACAACAGCAGATGCAGCAGCAGCAACAGCCCGGCGTGGGCGCCAAC 1351 TTCATGCCCACCCCGCCCATGATGCCGAACGGCATGTTCCCTCAACAGCA 1401 GCCCCAGCAGGCGATGCGCCTGAACGCCAACGGCCTCGGCGGCCAGAAGC 1451 GCCCCCACGAGATGATGGGTATGTCCAGCGGCGGCATGGACGGTATGAAC 1501 CAGATGGTGCCCGTCGGCGGCGGCGGCATGGGCATGTCGATGCCGATGGG 1551 TATGAACAACCCCATGCAGGGCGGTATGCCCCTGCAGCCTCCGCCCCAGG 1601 TGCAGGCTCCCGGTCAGGGCCCCATGATGAGCGCCCCTCAGCAGCAACAG 1651 CAGCAACCGGCCCCTAAGCGGGCGAAGACCGACGATGTGCTGCGCCAGAA 1701 CACGGGCACCAGCCTCCTGGAGACGTTCGACGCCAAGCAGATCCGCGTGC 1751 ACGTGGACCTGATCCGCGCTGCCGCGGTGACCCAGAAGGCCCAGCAGCCT 1801 CCCCCGGCTAACCCCGACGACGCGTGCAAGGTCTGCGCGCTGACGAAGCT 1851 GTCGTTCGAGCCCCCGGTGATTTACTGCTCGAGCTGCGGCCTGCGCATCA 1901 AGCGCGGCCAGATCTTCTACAGCACGCCTCCGGACCACGGCAACGACCTG 1951 AAGGGTTACTTCTGCCACCAGTGCTTCACCGACCAGAAGGGCGAGCGCAT 2001 CCTGGTGGAGGGCGTCTCGATCAAGAAGAGCGACCTGGTGAAGCGCAAGA 2051 ACGATGAGGAGATCGAGGAGGGGTGGGTGCAGTGCGACCACTGCGAGGGC 2101 TGGGTGCACCAGATTTGCGGCATGTTCAACAAGGGCCGGAACAACACGGA 2151 CGTCCACTACCTGTGCCCTGACTGCCTGGCCGTGGGCTACGAGCGCGGCC 2201 AGCGCCAGAAGACGGAGGTCCGCCCCCAGGCGATGCTCGAGGCGAAGGAT 2251 CTGCCCACGTCCCGGCTGTCCGAGTTTATTACGGAGCGCCTGAACCGCGA 2301 GCTGGAGAAGGAGCACCACAAGCGGGCTGAGCAGCAGGGCAAGCCGCTGC 2351 ACGAGGTGGCGAAGCCCGAGCCCCTGACCGTGCGGATGATCAACTCCGTG 2401 ATGAAGAAGTGCGAGGTCAAGCCGCGCTTCCACGAGACGTTCGGCCCCAC 2451 CGACGGCTACCCCGGGGAGTTCGGCTACCGGCAGAAGGTGCTGCTGCTGT 2501 TCCAAAGCCTGGACGGTGTCGACGTGTGCCTGTTCTGCATGTACGTGCAG 2551 GAGTACGGCAAGGACTGCCCTGCGCCCAACACCAACGTGGTGTACCTGTC 2601 GTATCTGGACTCCGTCAAGTACTTCCGCCCTGAGATTCCCTCGGCCCTGG 2651 GCCCTGCCGTGTCGCTGCGCACCTTCGTGTACCACCAACTCCTGATCGCC 2701 TACGTGGAGTTTACCCGCAACATGGGTTTTGAGCAGATGTACATTTGGGC 2751 GTGCCCGCCGATGCAAGGCGACGACTACATCCTGTACTGCCACCCGACCA 2801 AGCAGAAGACGCCGCGCTCGGACCGCCTGCGCATGTGGTACATTGAGATG 2851 CTGAAGCTGGCGAAGGAGGAGGGTATCGTGAAGCACCTGAGCACGCTGTG 2901 GGATACGTACTTCGAGGGCGGTCGCGACCACCGGATGGAGCGCTGCTCGG 2951 TCACGTACATTCCGTACATGGAGGGCGACTACTGGCCCGGCGAGGCTGAG 3001 AACCAGCTCATGGCCATTAACGACGCGGCCAAGGGCAAGCCTGGGACCAA 3051 GGGTGCGGGCAGCGCCCCGAGCCGCAAGGCCGGTGCCAAGGGCAAGCGCT 3101 ACGGCGGTGGCCCCGCCACGGCTGATGAGCAGCTGATGGCCCGCCTCGGT 3151 GAGATCCTGGGCGGGAACATGCGGGAGGACTTCATTGTGGTCCACATGCA 3201 GGTGCCCTGCACGTTCTGCCGCGCTCACATTCGGGGTCCGAACGTGGTGT 3251 ACCGCTATCGGACGCCGCCTGGCGCGACCCCTCCCAAGGCTGCCCCCGAG 3301 CGCAAGTTCGAGGGCATCAAGCTGGAGGGCGGTGGCCCCAGCGTGCCCGT 3351 GGGCACCGTCTCGAGCCTGACGATCTGCGAGGCGTGCTTTCGCGACGAGG 3401 AGACGCGCACGCTGACCGGCCAACAGCTGCGCCTGCCCGCTGGCGTGTCG 3451 ACCGCTGAGCTCGCGATGGAGAAGCTGGAGGAGATGATCCAGTGGGACCG 3501 CGACCCTGACGGCGACATGGAGAGCGAGTTCTTCGAGACGCGGCAGACCT 3551 TCCTGTCGCTGTGCCAGGGCAACCACTACCAGTTCGACACCCTCCGCCGC 3601 GCTAAGCACTCGTCGATGATGGTGCTCTACCACCTGCACAACCCCCACTC 3651 GCCGGCGTTCGCGTCCTCGTGCAACCAGTGCAACGCCGAGATCGAGCCGG 3701 GCAGCGGCTTTCGCTGCACCGTGTGCCCCGACTTCGACATGTGCGCCAGC 3751 TGCAAGGTCAACCCTCATAAGCGCGCCCTGGACGAGACGCGCCAGCGGCT 3801 GACCGAGGCCGAGCGCCGGGAGCGCAACGAGCAGCTGCAGAAGACCCTCG 3851 CCCTGCTGGTGCACGCCTGCGGCTGCCACAACAGCGCGTGCGGCTCCAAC 3901 AGCTGCCGCAAGGTGAAGCAGCTGTTCCAGCACGCGGTCCACTGCCAGAG 3951 CAAGGTGACCGGGGGCTGCCAGCTGTGCAAGAAGATGTGGTGCCTGCTGA 4001 ACCTGCACGCCAAGTCCTGCACCCGCGCGGACTGCCCGGTGCCGCGCTGC 4051 AAGGAGCTGAAGGAGCTGCGCCGGCGCCAAACGAACCGGCAGGAGGAGAA 4101 GCGCCGGGCGGCCTACGCCGCTATGCTGCGCAACCAGATGGCCGGCAGCC 4151 AGGCTCCGCGCCCCATGTAA.

LexA-p300 Fusion Protein (SEQ ID NO 2) is the respective protein sequence encoded by the nucleic acid sequence of SEQ ID NO 1. The LexA binding domain is the sequence up to and including amino acid 230 and the full-length p300 HAT domain sequence begins at amino acid 234. A 3-amino acid peptide linker (GVL) connects LexA binding domain and p300, which represents the DNA restriction site PpuMI (9 bp):

SEQ ID NO 2 1 MKALTARQQEVFDLIRDHISQTGMPPTRAEIAQRLGFRSPNAAEEHLKAL 51 ARKGVIEIVSGASRGIRLLQEEEEGLPLVGRVAAGEPLLAQQHIEGHYQV 101 DPSLFKPNADFLLRVSGMSMKDIGIMDGDLLAVHKTQDVRNGQVVVARID 151 DEVTVKRLKKQGNKVELLPENSEFKPIVVDLRQQSFTIEGLAVGVIRNGD 201 WLEFPGIRRPWRPLESTCSQANSGRISYDLGVLMVPMGAPAMPMGNNGSP 251 MLNGMGMFNAPQQTVPNGGPGGVNPMGQVPAMPAPIPNGGLPGMNAAGGA 301 DDPAKQREQSIQKQQRWLLFLRHCAKCRAPGEDCQLKSQCKFGKQLWQHI 351 LSCQNPACEYPRCTNSKDLLKHHQKCQEQTCPVCMPVKDYVKKTRQATQQ 401 QQQMQQQQQIQQQQQQQMQQQQMQQQQLQQQQMQQQQQMQQQQQPGVGAN 451 FMPTPPMMPNGMFPQQQPQQAMRLNANGLGGQKRPHEMMGMSSGGMDGMN 501 QMVPVGGGGMGMSMPMGMNNPMQGGMPLQPPPQVQAPGQGPMMSAPQQQQ 551 QQPAPKRAKTDDVLRQNTGTSLLETFDAKQIRVHVDLIRAAAVTQKAQQP 601 PPANPDDACKVCALTKLSFEPPVIYCSSCGLRIKRGQIFYSTPPDHGNDL 651 KGYFCHQCFTDQKGERILVEGVSIKKSDLVKRKNDEEIEEGWVQCDHCEG 701 WVHQICGMFNKGRNNTDVHYLCPDCLAVGYERGQRQKTEVRPQAMLEAKD 751 LPTSRLSEFITERLNRELEKEHHKRAEQQGKPLHEVAKPEPLTVRMINSV 801 MKKCEVKPRFHETFGPTDGYPGEFGYRQKVLLLFQSLDGVDVCLFCMYVQ 851 EYGKDCPAPNTNVVYLSYLDSVKYFRPEIPSALGPAVSLRTFVYHQLLIA 901 YVEFTRNMGFEQMYIWACPPMQGDDYILYCHPTKQKTPRSDRLRMWYIEM 951 LKLAKEEGIVKHLSTLWDTYFEGGRDHRMERCSVTYIPYMEGDYWPGEAE 1001 NQLMAINDAAKGKPGTKGAGSAPSRKAGAKGKRYGGGPATADEQLMARLG 1051 EILGGNMREDFIVVHMQVPCTFCRAHIRGPNVVYRYRTPPGATPPKAAPE 1101 RKFEGIKLEGGGPSVPVGTVSSLTICEACFRDEETRTLTGQQLRLPAGVS 1151 TAELAMEKLEEMIQWDRDPDGDMESEFFETRQTFLSLCQGNHYQFDTLRR 1201 AKHSSMMVLYHLHNPHSPAFASSCNQCNAEIEPGSGFRCTVCPDFDMCAS 1251 CKVNPHKRALDETRQRLTEAERRERNEQLQKTLALLVHACGCHNSACGSN 1301 SCRKVKQLFQHAVHCQSKVTGGCQLCKKMWCLLNLHAKSCTRADCPVPRC 1351 KELKELRRRQTNRQEEKRRAAYAAMLRNQMAGSQAPRPM*.

LexA-p300 HAT domain DNA (SEQ ID NO 3) is a nucleic acid sequence corresponding to a gene encoding the LexA binding domain-acetyl-transferase (HAT) domain of the Chlamy p300 protein. Similarly, the LexA binding domain is the sequence up to and including nucleotide 690 and the p300 HAT domain sequence begins at nucleotide 700. A 3-amino acid peptide linker (GVL) connects LexA binding domain and p300, which represents the DNA restriction site PpuMI (9 bp).

SEQ ID NO 3 1 ATGAAGGCTCTCACCGCTCGCCAACAGGAGGTCTTTGATCTGATTCGCGA 51 CCACATCTCGCAGACCGGCATGCCGCCGACCCGGGCGGAGATTGCTCAGC 101 GGCTGGGCTTCCGGAGCCCCAACGCGGCCGAGGAGCACCTGAAGGCCCTC 151 GCGCGCAAGGGGGTGATCGAGATTGTCTCCGGCGCTAGCCGCGGCATCCG 201 CCTGCTGCAGGAGGAGGAGGAGGGCCTGCCGCTGGTCGGGCGGGTCGCGG 251 CCGGGGAGCCTCTGCTGGCCCAGCAGCACATCGAGGGCCACTACCAAGTG 301 GACCCCTCGCTGTTTAAGCCCAACGCGGACTTCCTGCTCCGGGTGTCGGG 351 CATGAGCATGAAGGACATCGGCATCATGGACGGCGACCTCCTGGCGGTGC 401 ACAAGACCCAGGACGTGCGCAACGGCCAGGTGGTCGTCGCGCGGATTGAC 451 GACGAGGTGACCGTGAAGCGGCTGAAGAAGCAGGGCAACAAGGTCGAGCT 501 GCTGCCCGAGAACTCGGAGTTCAAGCCTATCGTGGTCGACCTGCGCCAGC 551 AGTCCTTCACCATCGAGGGCCTGGCCGTGGGGGTCATCCGCAACGGTGAC 601 TGGCTGGAGTTCCCCGGCATCCGGCGCCCGTGGCGGCCGCTGGAGTCCAC 651 CTGCAGCCAGGCGAACTCCGGCCGCATCTCCTACGATCTGGGGGTCCTTG 701 AGGTGGCCAAGCCGGAGCCGCTGACCGTGCGGATGATCAACAGCGTGATG 751 AAGAAGTGCGAGGTCAAGCCCCGCTTCCACGAGACGTTCGGTCCGACCGA 801 CGGTTACCCCGGGGAGTTCGGCTACCGGCAGAAGGTGCTCCTCCTGTTCC 851 AGTCCCTCGACGGCGTCGACGTGTGCCTGTTCTGCATGTACGTGCAGGAG 901 TACGGGAAGGACTGCCCGGCGCCCAACACGAACGTGGTGTACCTGAGCTA 951 CCTGGACTCCGTCAAGTATTTCCGCCCCGAGATTCCCAGCGCCCTGGGCC 1001 CTGCGGTGAGCCTGCGGACCTTCGTGTACCACCAGCTCCTGATTGCGTAC 1051 GTGGAGTTCACGCGCAACATGGGCTTCGAGCAGATGTACATTTGGGCGTG 1101 CCCCCCCATGCAGGGGGACGACTATATCCTGTATTGCCATCCCACGAAGC 1151 AGAAGACCCCGCGCTCGGACCGCCTGCGCATGTGGTACATCGAGATGCTG 1201 AAGCTGGCTAAGGAGGAGGGCATCGTGAAGCACCTGTCGACGCTGTGGGA 1251 CACCTACTTCGAGGGCGGTCGCGACCACCGGATGGAGCGCTGCAGCGTGA 1301 CCTACATCCCCTACATGGAGGGCGACTACTGGCCTGGCGAGGCCGAGTAA.

LexA-p300 HAT domain AA (SEQ ID NO 4) is an exemplary GEE protein sequence of a LexA binding domain-Chlamy p300 protein, where the Chlamy p300 is limited to the histone acetyl-transferase (HAT) domain of the Chlamy p300 enzyme. The LexA binding domain is the sequence up to and including amino acid 230 and the p300 HAT domain sequence begins at amino acid 234. The 3-amino acid peptide linker (GVL) connects LexA binding domain and p300:

SEQ ID NO 4 1 MKALTARQQEVFDLIRDHISQTGMPPTRAEIAQRLGFRSPNAAEEHLKAL 51 ARKGVIEIVSGASRGIRLLQEEEEGLPLVGRVAAGEPLLAQQHIEGHYQV 101 DPSLFKPNADFLLRVSGMSMKDIGIMDGDLLAVHKTQDVRNGQVVVARID 151 DEVTVKRLKKQGNKVELLPENSEFKPIVVDLRQQSFTIEGLAVGVIRNGD 201 WLEFPGIRRPWRPLESTCSQANSGRISYDLGVLEVAKPEPLTVRMINSVM 251 KKCEVKPRFHETFGPTDGYPGEFGYRQKVLLLFQSLDGVDVCLFCMYVQE 301 YGKDCPAPNTNVVYLSYLDSVKYFRPEIPSALGPAVSLRTFVYHQLLIAY 351 VEFTRNMGFEQMYIWACPPMQGDDYILYCHPTKQKTPRSDRLRMWYIEML 401 KLAKEEGIVKHLSTLWDTYFEGGRDHRMERCSVTYIPYMEGDYWPGEAE*.

Codon-optimized Venus gene sequence is a preferred embodiment:

SEQ ID NO 5 1 ATGGTGTCGAAGGGTGAGGAGCTGTTTACCGGTGTCGTGCCTATTCTGGT 51 GGAGCTCGACGGCGACGTCAACGGGCACAAGTTTTCGGTGTCCGGCGAGG 101 GTGAGGGGGACGCGACGTACGGCAAGCTCACGCTGAAGCTGATCTGCACC 151 ACCGGCAAGCTGCCCGTCCCCTGGCCGACGCTGGTGACCACCCTGGGCTA 201 CGGCCTGCAGTGCTTCGCCCGCTACCCGGACCACATGAAGCAGCACGACT 251 TCTTCAAGTCGGCCATGCCCGAGGGGTACGTGCAGGAGCGCACGATCTTC 301 TTTAAGGACGATGGCAACTACAAGACCCGCGCTGAGGTGAAGTTCGAGGG 351 CGATACGCTGGTGAACCGCATCGAGCTCAAGGGCATCGACTTCAAGGAGG 401 ACGGCAACATCCTGGGTCACAAGCTGGAGTACAACTACAACTCCCACAAC 451 GTGTACATCACGGCGGATAAGCAGAAGAACGGCATCAAGGCCAACTTTAA 501 GATTCGCCATAACATCGAGGACGGCGGCGTGCAGCTCGCCGACCACTACC 551 AGCAGAACACCCCGATCGGCGACGGCCCCGTGCTGCTGCCCGATAACCAC 601 TACCTCAGCTACCAGTCGGCCCTGTCCAAGGATCCCAACGAGAAGCGCGA 651 TCACATGGTCCTCCTGGAGTTCGTGACCGCCGCTGGCATCACCCTGGGCA 701 TGGACGAGCTGTACAAGTAA.

SEQ ID NO 6 is the protein encoded by the nucleic acid of SEQ ID NO 5. The Venus AA sequence:

SEQ ID NO 6 1 MVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKLICT 51 TGKLPVPWPTLVTTLGYGLQCFARYPDHMKQHDFFKSAMPEGYVQERTIF 101 FKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHN 151 VYITADKQKNGIKANFKIRHNIEDGGVQLADHYQQNTPIGDGPVLLPDNH 201 YLSYQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYK*.

SEQ ID NO 7 is a nucleic acid encoding a Venus-Bcl-x_(L) fusion of the invention. It was designed to represent preferred codon usage in algae. The sequence up to and including nucleotide 717 represents Venus. A 3-amino acid peptide linker (GVL) connects Venus and Bcl-x_(L), which represents the DNA restriction site PpuMI (9 bp). Bcl-x_(L) begins at nucleotide 726.

SEQ ID NO 7 1 ATGGTGTCGAAGGGTGAGGAGCTGTTTACCGGTGTCGTGCCTATTCTGGT 51 GGAGCTCGACGGCGACGTCAACGGGCACAAGTTTTCGGTGTCCGGCGAGG 101 GTGAGGGGGACGCGACGTACGGCAAGCTCACGCTGAAGCTGATCTGCACC 151 ACCGGCAAGCTGCCCGTCCCCTGGCCGACGCTGGTGACCACCCTGGGCTA 201 CGGCCTGCAGTGCTTCGCCCGCTACCCGGACCACATGAAGCAGCACGACT 251 TCTTCAAGTCGGCCATGCCCGAGGGGTACGTGCAGGAGCGCACGATCTTC 301 TTTAAGGACGATGGCAACTACAAGACCCGCGCTGAGGTGAAGTTCGAGGG 351 CGATACGCTGGTGAACCGCATCGAGCTCAAGGGCATCGACTTCAAGGAGG 401 ACGGCAACATCCTGGGTCACAAGCTGGAGTACAACTACAACTCCCACAAC 451 GTGTACATCACGGCGGATAAGCAGAAGAACGGCATCAAGGCCAACTTTAA 501 GATTCGCCATAACATCGAGGACGGCGGCGTGCAGCTCGCCGACCACTACC 551 AGCAGAACACCCCGATCGGCGACGGCCCCGTGCTGCTGCCCGATAACCAC 601 TACCTCAGCTACCAGTCGGCCCTGTCCAAGGATCCCAACGAGAAGCGCGA 651 TCACATGGTCCTCCTGGAGTTCGTGACCGCCGCTGGCATCACCCTGGGCA 701 TGGACGAGCTGTACAAGGGGGTCCTTATGAGCCAGAGCAACCGGGAGCTG 751 GTGGTGGACTTCCTGAGCTACAAGCTGAGCCAAAAGGGCTATAGCTGGTC 801 GCAGTTCTCCGACGTCGAGGAGAACCGGACCGAGGCCCCCGAGGGGACCG 851 AGTCCGAGATGGAGACGCCGAGCGCGATTAACGGCAACCCGAGCTGGCAC 901 CTGGCGGACTCCCCTGCCGTGAACGGCGCGACCGGCCACAGCTCCAGCCT 951 GGACGCGCGCGAGGTCATCCCGATGGCGGCCGTGAAGCAGGCCCTCCGCG 1001 AGGCCGGCGACGAGTTCGAGCTGCGCTATCGCCGCGCTTTCTCGGACCTG 1051 ACCAGCCAGCTGCACATCACCCCCGGCACGGCTTACCAAAGCTTCGAGCA 1101 GGTGGTGAACGAGCTGTTCCGCGACGGCGTGAACTGGGGTCGCATCGTGG 1151 CGTTCTTCAGCTTCGGCGGTGCGCTGTGCGTGGAGAGCGTCGACAAGGAG 1201 ATGCAGGTGCTGGTGTCGCGCATTGCGGCTTGGATGGCCACCTACCTGAA 1251 CGACCACCTGGAGCCCTGGATTCAGGAGAACGGCGGCTGGGACACCTTCG 1301 TCGAGCTGTACGGCAACAACGCTGCGGCGGAGAGCCGCAAGGGCCAAGAG 1351 CGGTTCAACCGCTGGTTCCTCACGGGGATGACCGTGGCGGGCGTCGTCCT 1401 GCTGGGCAGCCTGTTCTCGCGGAAGTAA.

Venus-Bcl-x_(L) Protein (SEQ ID NO 8) is the protein fusion encoded by the nucleic acid of SEQ ID NO 7. The underlying Bcl-x_(L) protein sequence (233 AA) is encoded by the DNA sequence GenBank ID: 20336334 from NCBI Database:

SEQ ID NO 8 1 MVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKLICT 51 TGKLPVPWPTLVTTLGYGLQCFARYPDHMKQHDFFKSAMPEGYVQERTIF 101 FKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHN 151 VYITADKQKNGIKANFKIRHNIEDGGVQLADHYQQNTPIGDGPVLLPDNH 201 YLSYQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYKGVLMSQSNREL 251 VVDFLSYKLSQKGYSWSQFSDVEENRTEAPEGTESEMETPSAINGNPSWH 301 LADSPAVNGATGHSSSLDAREVIPMAAVKQALREAGDEFELRYRRAFSDL 351 TSQLHITPGTAYQSFEQVVNELFRDGVNWGRIVAFFSFGGALCVESVDKE 401 MQVLVSRIAAWMATYLNDHLEPWIQENGGWDTFVELYGNNAAAESRKGQE 451 RFNRWFLTGMTVAGVVLLGSLFSRK*.

EXAMPLE 1 An Exemplary Vector of the Invention

FIG. 1 illustrates a construct in accordance to the invention. The starting vector is pSP124. See V. Lumbreras, D.R.S. and S. Purton, Plant J., 14(4):441-447 (1998). Features of the vector are listed in FIG. 1, i.e. the two regions indicated in FIG. 1 to be part of the backbone vector, pSP124.

None pSP124 sequences are preferably engineered as individual synthetic DNA fragments and strung together via restriction enzyme sites, by well-known techniques. Alternative approaches and mixtures of approaches are available. For example, some features are optionally introduced as PCR products or “cut and pasted” from other available constructs. Typically, sequencing and/or other assays (e.g. size analysis, hybridization) are used to verify the resultant vector.

As an example, one section of the insert is created by synthesis of a region having a BamHI site and ending with an EcoRI site (“Synthetic_(—)1”). This region provides a transcriptional enhancer region, two LexA binding motifs, a rubisco transcriptional promoter (including the first intron of rbcS2), a YFP-Bcl-x_(L) fusion protein, and a rubisco 3′UTR. The YFP and Bcl-x_(L) coding regions were designed in this instance to reflect the preferred codon usage in algae.

Another region incorporated is prepared by high fidelity PCR and effectively provides the p300 (HAT) gene (“Genomic PCR”). Flanking the genomic PCR fragment are two additional regions prepared by synthetic DNA (“Synthetic_(—)2”). The region transcriptionally upstream of the p300 gene provides the LexA binding domain coding sequence downstream of transcriptional promoters and two LexA binding sites. The Synthetic_(—)2 region provides a 3′UTR. Combined, the Synthetic_(—)2 and Genomic regions create a complete transcription unit encoding a LexA-p300 fusion protein (GEE).

Effectively, FIG. 1 and these explanations provide an example of the features of a construct of the invention and illustrate methods of creating the features within an algae compatible plasmid. Two transcriptional units face opposing directions and each have two LexA binding sites, creating an opportunity for the LexA-p300 to bind at any of four sites and affect transcription levels of either transcriptional unit. A third transcriptional unit provides a selection marker, bleomycin-resistance.

It will be recognized by a skilled artisan that other design approaches are available, including the incorporation within the vector of additional or different genes incorporated for expression, different gene expression control features, other restriction sites, change the number of LexA-BS, and so on, without changing the concept behind the creation of this vector, namely to effectively increase the levels of expression of the genes located in vicinity of a DNA-BS, in the presence of a GEE that recognizes/binds the BS.

EXAMPLE 2 Additional Exemplary Vectors

Two vectors are constructed which are in most respects identical, but for the presence of a GEE unit. The vectors are otherwise the same to each other and similar to the vector of FIG. 2A. The use of these vectors in parallel allows testing of the p300 activity and the role of LexA in otherwise identical genetic backgrounds. The use of two vectors also allows for modulation of the GEE activities by such additional engineering, for example, as addition of other genes, addition of multiple copies of GEE and so on.

Notably, “LexA BS” does not refer to any limit of the number of binding sites; anything from one BS to many BS are possibly located at the indicated position. Practically speaking, it is unlikely to utilize more than about 8 BS, as benefit from additional sites would be unlikely. Preferably, about 2 to 6 BS are located in the region at or near the 5′end of genes desirably expressed, more preferably there are 2-4 BS.

EXAMPLE 3 Characterization of GEE Efficacy with a Bidirectional Promoter

Experiment 1. Use the bidirectional construct with YFP reporter in the position of the GOI and either one of two variants of the GEE construct: [1] in which the LexA-p300 chimeric gene is driven in the opposite direction (FIG. 1) or [2] in which only LexA is driven in the opposite direction which serves as a control.

Algae are transformed with the two constructs and selected on appropriate antibiotic containing selection media (e.g. media containing bleocin). After selection, 100 colonies from transformation for each construct are chosen to analyze the expression of the YFP transgene by assaying mRNA expression using rtPCR, protein expression with Western blot, and single cell fluorescence by flow cytometry and fluorescent microscopy. The clonal populations are passaged for 2, 4, 6, and 10 generations. The frequency of high-level expression of YFP are compared between the LexA-p300 and LexA only clones. The LexA-p300 GEE increases expression and maintains a higher level of nuclear transgene expression over time.

EXAMPLE 4 Characterization of GEE Efficacy Using Distinct Plasmids

Generate two sets of stable clones: Set one is a stable cell line with the incorporated transgene encoding the LexA-p300 fusion (FIG. 2A) that is then transformed with a plasmid that expresses the YFP vector (FIG. 2B). Set two is a stable cell line with the incorporated transgene encoding the LexA only (related to FIG. 2A with the exception of the p300 fusion partner) that is then transformed with a plasmid that expresses the YFP vector (FIG. 2B).

Select for stable cell lines and characterize the YFP expression over time by assaying mRNA expression by rtPCR, western blot to determine protein expression, and assay of single cell fluorescence by flow cytometry and fluorescent microscopy. The clonal populations will be passaged for 2, 4, 6, and 10 generations. Similarly, the frequency of high-level expression of YFP are compared between the LexA-p300 and LexA only clones. The LexA-p300 GEE increases expression and maintains a higher level of nuclear transgene expression over time.

The invention described above should be read in conjunction with the accompanying claims and drawings. The description of embodiments and examples enable one to practice various implementations of the invention and they are not intended to limit the invention to the preferred embodiment, but to serve as a particular example of the invention. Those skilled in the art will appreciate that they may readily use the conception and specific embodiments disclosed as a basis for modifying or designing other methods and systems for carrying out the same purposes of the present invention.

All references, including publications, patent applications, patents, and website content cited herein are hereby incorporated by reference to the same extent as if each reference were individually and specifically indicated to be incorporated by reference and was set forth in its entirety herein.

The websites mentioned herein were last visited on Oct. 30, 2010.

The use of the terms “a” and “an” and “the” and similar references in the context of describing the invention are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context.

Recitation of ranges of values herein are merely intended to serve as a shorthand method of referring individually to each separate value falling within the range, unless otherwise indicated herein, and each separate value is incorporated into the specification as if it were individually recited herein. The word “about,” when accompanying a numerical value, is to be construed as indicating a deviation of up to and inclusive of 10% from the stated numerical value. The use of any and all examples, or exemplary language (“e.g.” or “such as”) provided herein, is intended merely to better illuminate the invention and does not pose a limitation on the scope of the invention unless otherwise claimed. NO language in the specification should be construed as indicating any non-claimed element as essential to the practice of the invention. 

What is claimed is:
 1. A eukaryotic unicellular algae, comprising: a construct for enhanced gene expression in an alga cell, wherein said construct further comprises a eukaryotic unicellular algae compatible transcriptional promoter functionally upstream of a coding sequence for a gene expression enhancer (GEE) fusion protein, wherein the fusion protein comprises an algae derived p300 functionally fused to the DNA binding protein, wherein at least the portion of the coding sequence of the DNA binding protein domain is codon optimized for improved expression in a eukaryotic unicellular algae; at least one transgene functionally downstream of a eukaryotic unicellular algae compatible transcriptional promoter; and at least one DNA region that is a binding site for the DNA binding protein, in vicinity of at least one of said transcriptional promoters.
 2. The eukaryotic unicellular algae of claim 1, wherein the DNA binding protein is LexA DNA Binding domain.
 3. The eukaryotic unicellular algae of claim 1, wherein the p300 part of the GEE fusion protein is from Chlamydomonas reinhardtii.
 4. The eukaryotic unicellular algae of claim 1, wherein only a HAT domain of the p300 protein is part of the GEE fusion protein.
 5. The eukaryotic unicellular algae of claim 1, wherein the transgene is codon modified for improved expression in algae.
 6. The eukaryotic unicellular algae of claim 5, wherein the transgene is a fluorescence-Bcl-x_(L) fusion gene.
 7. The eukaryotic unicellular algae of claim 6, wherein the fluorescence-Bcl-x_(L) fusion gene is a YFP-Bcl-x_(L) fusion.
 8. The eukaryotic unicellular algae of claim 6, wherein the fluorescence-Bcl-x_(L) fusion gene is a Venus-Bcl-x_(L) fusion.
 9. The eukaryotic unicellular algae of claim 1, further comprising at least one selective marker.
 10. The eukaryotic unicellular algae of claim 9, wherein the GEE fusion protein and the at least one transgene are introduced into the system on one vector and structurally arranged to be expressed from one bidirectional promoter region and comprising DNA binding sites in the vicinity of both promoters.
 11. The eukaryotic unicellular algae of claim 9, wherein the GEE fusion protein and the transgene are introduced in the system on separate vectors, each comprising a selective marker and the selective markers are not the same.
 12. The eukaryotic unicellular algae of claim 1, wherein the algae compatible transcriptional promoters are hsp70, rbcS, nitA, tubA2 or a combination thereof.
 13. The eukaryotic unicellular algae of claim 1, wherein the GEE fusion protein comprises a DNA binding domain functionally fused to an algae derived p300 homologue having at least 80% identity over the HAT region to the p300 from Chlamydomonas reinhardtii.
 14. The eukaryotic unicellular algae of claim 13, wherein the GEE fusion protein comprises a DNA binding domain functionally fused to the HAT domain from an algae derived p300 homologue, the homologue having at least 80% identity over the HAT region to the p300 from Chlamydomonas reinhardtii. 